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Spatial Mixing for Independent Sets in Poisson Random Trees 


Varsha Dani* Thomas P. Hayes* Cristopher Moore* 


Abstract 

We consider correlation decay in the hard-core model with fugacity A on a rooted tree T in 
which the arity of each vertex is independently Poisson distributed with mean d. Specifically, 
we investigate the question of which parameter settings (d, A) result in strong spatial mixing, 
weak spatial mixing, or neither. (In our context, weak spatial mixing is equivalent to Gibbs 
uniqueness.) For finite fugacity, a zero-one law implies that these spatial mixing properties hold 
either almost surely or almost never, once we have conditioned on whether T is finite or infinite. 

We provide a partial answer to this question, which implies in particular that 

1. As d — > oo, weak spatial mixing on the Poisson tree occurs whenever A < /(d) — o(l) but 
not when A is slightly above /(d), where /(d) is the threshold for WSM (and SSM) on the 
d-regular tree. This suggests that, in most cases, Poisson trees have similar spatial mixing 
behavior to regular trees. 

2. When 1 < d < 1.179, there is weak spatial mixing on the Poisson(d) tree for all values of 
A. However, strong spatial mixing does not hold for sufficiently large A. This is in contrast 
to regular trees, for which strong spatial mixing and weak spatial mixing always coincide. 

For infinite fugacity SSM holds only when the tree is finite, and hence almost surely fails on the 
Poisson(d) tree when d > 1. We show that WSM almost surely holds on the Poisson(d) tree for 
d < e 1 /^/y/2 = 1.434..., but that it fails with positive probability if d > e. 
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1 Introduction 


Spatial mixing, or the decay of correlations between spins in a spin system, is a fundamental question 
of interest in statistical physics. It is intimately related to temporal mixing for the corresponding 
Glauber dynamics Markov chain, which means fast convergence to its equilibrium distribution. 
There are two flavors of spatial mixing: strong and weak (see Section 2.3 for definitions.) For 


our purposes, weak spatial mixing is equivalent to Gibbs uniqueness, another fundamental concept 
from statistical physics. 

The hard core model defines a distribution over the independent sets of a graph G in terms of 
a fugacity A > 0. When G is finite and A = 1, this is the uniform distribution. More generally, 
an independent set S has a probability proportional to Al s l, so that when A > 1, the distribution 
is biased towards larger independent sets, and when A < 1, it is biased towards smaller ones. By 
convention, when A = +oo, the conditional distribution on finite subgraphs is uniform over all 
independent sets of maximum size. 

In computer science, the problem of sampling from this distribution when A = 1 is well-known 
to be poly-time equivalent to the problem of approximately counting the independent sets of a 
graph, which is known to be a hard problem in general. We refer the reader to recent work by Sly 
and Sun [7] for further hardness results. 

A seminal paper of D. Weitz [10J found that the infinite regular d- ary tree has the same threshold 
for weak and strong spatial mixing, namely A = d d /(d — l) rf+1 ~ e/d. More importantly, this is 
a worst case: every other graph of maximum degree d + 1 also exhibits WSM and SSM for all A 
up to the aforementioned threshold. At the time, this established the strongest positive results for 
spatial mixing for a wide variety of graphs, including, for instance, the square grid. 

Brightwell, Hagrstrom and Winkler [2] showed that there are graphs, even trees, for which the 
property of WSM is non-monotone as a function of A. That is, increasing A can actually decrease 
the extent to which correlations travel over long distances, and so WSM holds at sufficiently small 
and sufficiently large A, but not in between. They even give a more complicated construction (not a 
tree) for which the hard-core model exhibits WSM iff A E (0, Ai] U [A 2 , A 3 ] where Ai < A 2 < A 3 < 00 . 

Restrepo et al. [ 6 ] showed that for some graphs, such as the planar square lattice, SSM occurs 
at higher A than for the 4-regular tree. Recently, Vera, Vigoda and Yang | 8 ] have shown that the 
tree of self-avoiding walks on the square lattice contains a subtree which has WSM but not SSM, 
at a still higher value of A, but still below the conjectured critical value for the square lattice. (See 
| 8 j Lemmas 4, 7].) This suggests that it may not be such an uncommon phenomenon for WSM to 


occur without SSM. In Section 2.5 we exhibit an example of an infinite tree which has WSM for 
all A > 0 but does not have SSM for any A > 4. 

We consider random Poisson trees, in which every vertex has an independent, identically Poisson 
distributed number of children. This is a natural model because of its connection to sparse Erdos- 
Renyi random graphs, G(n,p). When d = 0(1) and p = d/n, for large n, the local structure of 
balls of volume o(y / n) is well approximated by a Poisson tree. 

It is natural, given an infinite graph, to consider the following threshold conjecture: There is a 
threshold A cr i t such that WSM holds if and only if A < A cr it. The analogous conjecture with SSM 
in place of WSM is also interesting. For instance, both conjectures are known to be true with 

\ ■ - AA 
Ynt - _ ^A+l 

when G is the infinite regular A-ary tree. Note that A cr it is asymptotically e/A as A —> 00 . 
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Brightwell, Haggstrom and Winkler [2] have constructed other graphs G for which the WSM con¬ 
jecture is false. 

Understanding weak spatial mixing for regular d-ary trees is relatively straightforward. Note 
that, in general, the conditional probability a v that node v is unoccupied, given that the parent of 
v is unoccupied, obeys the recurrence 



(i) 


where w ranges over the children of v. Since for the d-regular tree all the a w are equal, the 
problem boils down to understanding the stability of the fixed point of the iterated function fd(a) = 
(1 + A a d ) _1 . 

For random Poisson trees, the situation is more complicated. Since the various subtrees of a 
node are no longer identical, but merely identically distributed, we now need to, in effect, consider 
a recurrence relation on distributions rather than on real values. 

Intuitively, we may expect a Poisson(d) tree to behave something like a regular d- ary tree. We 
show that this is the case for large d, proving that WSM holds for A = c/d if c < e but not if c > e. 
On the other hand, for small d, there are several ways in which this is not the case. In particular, 

1. There are some settings of the Poisson parameter and fugacity for which there is weak mixing 
(almost surely) but not strong mixing (with positive probability). In particular, this happens 
when the expected degree is 1.1 and the fugacity is sufficiently large. 

2. For sufficiently small d, but still greater than 1, the Poisson tree exhibits WSM for all values 
of A, even A = oo. 

3. One might have thought that the phenomenon exploited in [2], where increasing A causes 
childless nodes to be occupied with high probability, which then cuts off the flow of information 
from their siblings up through their parent, is pathological. In fact, we will see that, for small 
enough d, this phenomenon is pervasive in Poisson trees. 

4. As a consequence, some of our results are non-monotonic, in that for 1.179 < d < 1.434, we 
know WSM occurs at A = oo, and for sufficiently small A, but we don’t know what happens 
in between. 


Before summarizing our main results, we begin by observing the following zero-one law for 
spatial mixing on Poisson trees with finite fugacity. 

Theorem 1.1. For all d > 1 and 0 < A < oo, conditioned on Poisson(d ) being infinite, the 
probability that the hard-core model on Poisson(d) with fugacity A has WSM (resp. SSM) is either 
zero or one. 


Note that for d < 1, the Poisson tree Poisson(d) is almost surely finite. 

In light of this zero-one law (proved in Section 2.4) for finite A, we focus our attention on the 
question of which parameter settings (d, A) result in SSM, WSM, or neither. 

We summarize our results for finite fugacities. See Figure [l] for graphs of some of the functions 
involved. Overall, our results describe where WSM and SSM occur or do not occur in various 
regions of the (d, A) plane. 
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Theorem 1.2. The hard-core model with fugacity A < oo on Poisson(d ) has the following proper¬ 
ties, almost surely, conditioned on being infinite. 


1. 

2 . 


WSM if d < 1.179..., for any 0 < A < oo. 

[ M 2 4 (i i \2 when d < \/2 + \/5 

SSMif A< 

1 3+ 2 cP-t d otherwise. 


3. WSM if A < e ^ , as d —>• oo, 

IVo WSM if A = S±°£!2 ; as d ^ oo. 


Thus, if the WSM threshold conjecture is true for the hard-core model on the Poisson tree, then 
we have shown that the location of the threshold is, for large d, asymptotically the same as for 
d-regular trees. On the other hand, unlike d-regular trees, there is a range of parameters for which 
the Poisson tree exhibits WSM but not SSM. Specifically, for 1 < d < 1.179... and for sufficiently 
large A, the Poisson tree almost surely has WSM but not SSM, conditioned on being infinite; see 
Remark |5.2| We conjecture that the Poisson tree almost surely exhibits SSM up to a threshold 
that is asymptotically e/d , the same as for d-regular trees. 

We also study spatial mixing properties of the Poisson(d) tree when the fugacity is infinite. The 
following theorem summarizes our results for this case. 


Theorem 1.3. There exists a constant d* > 1 such that for all 1 < d < d*, the hard-core model on 
Poisson(d ) with fugacity A = +oo exhibits WSM but not SSM, almost surely, conditioned on being 
infinite. Futhermore, we prove that the largest such d* is at least e 1 ^/\[2 = 1.434..., and at most 
e = 2.718.... 


We conjecture that e is the correct value for d*. We prove Theorem 1.3 in Section [3j 


2 Preliminaries 

2.1 The Poisson Tree 

Let d > 0. Consider a recursively generated random tree T, where we sample a non-negative integer 
X from the Poisson distribution with mean d, namely, 

e~ d d i 

Ni > 0) ProblX = i) = —-— 

i\ 

and define X to be the number of children of the root of T. Recursively, let each of these children 
be the root of a subtree sampled independently in the same manner. We call this the Poisson tree 
of average arity d, and denote it by Poisson(d). 

For d < 1. this tree is almost surely finite. For d > 1, the tree is infinite with positive probability, 
but unlike an infinite d-regular tree, it has leaves: indeed, each non-root node has probability eT d 
to be a leaf. (The root itself is a leaf with probability e -rf (l + d), since it is a also a leaf if it has 
only one child.) 
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A=oo 


0 1 . 1.434 e 

(a) For A = +oo, there is SSM only when d < 1, in which case the tree is almost surely finite. There is WSM 
for d < 1.434..., but not for d > e. 




(b) For finite A, and d > 1 there is SSM in the shaded region (to the left of the red curve d = 1.179... and 
the blue curve that is asymptotic to 1/d). The threshold for WSM is asymptotic to the purple curve, which 
is also the threshold for the regular d-ary tree. 


Figure 1: Illustration for Theorems 


1.3 


and 


1.2 


Proposition 2.1. Let d > 0, and let T be a Poisson tree Poisson(d). For R > 0, let f(R ) denote 
the number of nodes in level R of T. Then, almost surely, 


oo R 2 d K 


Proof. By Markov’s inequality, for each R, we have Prob ^ ^ n > R < R 3 / 2 . A union bound 
implies that there are almost surely only finitely many exceptions. □ 
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2.2 The Hard-Core Model (Independent Sets) 

In Statistical Mechanics, systems involving large numbers of interacting particles are often modeled 
by a spin system. This is defined in terms of an underlying graph, often an infinite lattice, whose 
vertices are called sites, each of which can be assigned a spin from some finite set Q. A configuration 
is a function assigning a spin to each site. A Gibbs measure is a probability distribution over 
configurations that satisfies a consistency criterion on all finite “patches”, or subsets of vertices. 
Specifically, for each finite subset Acf, with boundary dA = {v E A | 3{u,u;} E Eq,w ^ A}, 
and each boundary condition a : dA —> Q, the conditional distribution of the Gibbs measure 
restricted to A, conditioned on agreeing with a on dA, is prescribed. Although it is known [3] 
that a Gibbs measure always exists, it is not, in general, guaranteed to be unique. Indeed, many 
spin systems undergo a phase transition, where some critical threshold for a defining parameter 
determines whether Gibbs uniqueness holds or not. 

In the hard-core model, the spins correspond to a site being “occupied” or “unoccupied”. 
Adjacent sites are not allowed to both be occupied, and so configurations are independent sets of 
the graph. Configurations have probabilities that are exponential in the number of occupied sites: 
an independent set S has probability ^Al 5 !, where A > 0 is a parameter of the system called the 
fugacity, and the normalizing constant Z is called the partition function. 

We will also be concerned with the case A = +oo, in which, on finite patches, the prescribed 
distribution is considered to be uniform over all independent sets of the maximum possible size. 

2.3 Weak and Strong Spatial Mixing 

“Spatial mixing” refers to a phenomenon wherein correlations between spins decay as the distance 
between the vertices increases. 

Let A be any set of vertices, let D A be a containing set of vertices and let a, r : d'h —> Q 
be two boundary configurations for the larger set. We are interested in the total variation distance 
between the marginal distributions on configurations over A, conditioned on agreeing with dorr. 
Now, consider infinite families of such triples ('L,ct, r), indexed by the positive integers. If 

dist(A, chU) —> oo implies \\pL% — /x^||a —> 0, 
then we say that weak spatial mixing (WSM) holds. If 

dist(A, a © r) —> oo implies \\n% — //^||a —> 0, 

where a © r denotes the set of vertices on which a and r disagree, then we say that strong spatial 
mixing (SSM) holds. 

Intuitively, weak spatial mixing requires the effect of changing some spins to decay with distance, 
assuming all closer vertices are unconstrained, while strong spatial mixing requires the effect to 
decay even when some of the closer vertices are “frozen” in an adversarial way (which must be the 
same for both boundary conditions). Obviously, SSM implies WSM. 

The above definition of weak spatial mixing is easily seen to be equivalent to Gibbs uniqueness 
(see |9l Proposition 2.2]). We note that several alternative definitions of spatial mixing appear in 
the literature. In some of these, the rate of decay of correlation is required to be exponential in 
the distance, rather than merely tending to zero. All of our results apply in this setting as well. In 
some definitions of spatial mixing, one either restricts attention to the effect on a single vertex, i.e., 
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A = {u}, and/or one restricts attention to boundary conditions that disagree on a single boundary 
vertex. In the case when the convergence rate is required to be exponential, and moreover the 
graph is such that boundary sizes grow subexponentially, the restriction to a single disagreement 
doesn’t matter (by a union bound). For trees, however, boundary sizes often grow exponentially, 
in which case the specific rate of exponential decay of the effect of a vertex would matter. On the 
other end, there are spin systems where restricting A to be a singleton makes WSM hold trivally, 
even when it does not hold for larger sets A. [j] 

Remark 2.2. In the case of independent sets, it is well known that SSM on a graph G is equivalent 
to WSM on all subgraphs of G , because any boundary vertices that are frozen to be unoccupied 
can equivalently be deleted, and any that are occupied can equivalently have all their neighbors 
deleted. 

For independent sets on a tree, there is a simpler characterization of spatial mixing in terms 
of non-occupation probabilities. Specifically, let T be a finite tree with a designated root vertex r. 
For each vertex v, let a v denote the conditional probability that v is unoccupied, conditioned on u’s 
parent (if any) being unoccupied. These non-occupation probabilities satisfy the recurrence ©• 
When T is an infinite rooted tree, we will suppose that an adversary has set arbitrary values 
a z E [0,1] at level R + 1. In this case, we treat ([!]) as a recursive definition for a v , where v is at 
distance < R from the root. If for all sequences of boundary conditions, as R —> oo, a v converges 
to a well-defined limit a*, then we call a* the non-occupation probability of v. 

Since the righthand side of ([Tj) is a decreasing function of each of the a w , it follows by induction 
that, for any radius R, the extreme values of any a v are induced by the all-zeros and the all-ones 
boundaries. Thus, when proving the existence of a*, it suffices to consider boundary conditions of 
this type. 

Proposition 2.3. For the hard-core model on any infinite tree, the following are equivalent: 

1. For all vertices v, there is a well-defined non-occupation probability a*. 

2. Weak spatial mixing occurs. 

3. There is a unique Gibbs distribution. 

Furthermore, when the fugacity, X, is finite, this condition is equivalent to the three above: 

4■ For the root r, there is a well-defined non-occupation probability a*. 

Proof sketch. The equivalence of statements 2 and 3 is shown in [9j Proposition 2.2], 

To see that statement 3 implies statement 1, let v be any vertex in the tree. By Gibbs uniqueness, 
if we consider larger and larger balls centered at v, the effect of the boundary configuration goes to 
zero, and there is a well-defined marginal distribution on the spins of v and its parent. Essentially 
by definition, a* must equal the probability that v is unoccupied, conditioned on its parent being 
unoccupied. Note that the effect of all spins outside the subtree under v can only influence the spin 
of v through the spin of its parent, which we have conditioned on. 

1 Here is a rather contrived example. Start with any 2-spin system for which WSM does not hold. Replace each 
vertex with a pair of vertices, and decree that if the original vertex had spin 1, the pair have the same spin, but 
uniformly random 1 or 2. If the original vertex had spin 2, the pair have opposite spins, again uniformly random. 
We omit the details. 
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To see that statement 1 implies statement 3, suppose for contradiction that there were two 
distinct Gibbs measures. Then their marginals must differ on some finite patch Ach Starting 
with the root vertex vq, let vo,vi,V 2 , ..., be a breadth-first traversal of the tree. Then, for some 
configuration a, and some finite i, the probability of a restricted to vo,.. ■ ,Vi, must differ under 
the two Gibbs measures. Choose i to be minimal with respect to this property. In this case, 
Prob(<r(uj) | <t(vo), • • • , u(vi-i)) differs under the two distributions. The only way this can happen 
is if the parent of V{ is unoccupied under a, in which case the above conditional probability must 
equal a* in both measures, a contradiction. 

Statement 4 is a special case of statement 1, corresponding to weak spatial mixing at the root 
(since the root has no parent). Hence statement 1 implies statement 4. Statement 4 implies 
statement 1 when A is finite, because the recurrence ([Tj) holds at every vertex v, under every 
boundary condition. It follows that the limit a* cannot exist unless the limits exist for every 
child vertex w. □ 

As before, note that for infinite A, recurrence doesn’t hold. Indeed, it is possible for a* to 
be completely determined by a finite collection of its descendants. For instance, if two children of 
v are themselves childless, then a* = 1 regardless of any other consideration. Thus statement 4 is 
weaker than statements 1 through 3 when A = oo. 


2.4 Zero-One Law 

In this section, we prove Theorem To this end, we say that a boolean predicate, S, defined 
on rooted trees has property 1Z if, for every tree T, S(T) holds if and only if S(T') holds for every 
induced proper subtree T' of T. Note that any predicate with property 1Z must hold for every finite 
tree, by induction. 

Examples: 


1. 

2 . 

3. 


“T is finite” has property 1Z. 

When A < oo, the property “The hard-core model for T has WSM,” has property 1Z, in light 


of Proposition 2.3 


Similarly, for A < oo, “The hard-core model for T has SSM” also has property LZ. 


Lemma 2.4. Let A be a predicate with property 1Z. Then, for a random Poisson(d) tree, conditioned 
on being infinite, the conditional probability that A holds is either zero or one. 


Proof. Let p denote the probability that A(T) holds. Since A(T) holds iff A(T') holds for each of 
the top-level subtrees T' of T, and the number of such subtrees is Poisson distributed with mean 
d, we have 


i>0 


—d 


= e d ( p ~ 1 \ 

i\ 


This equation is easily seen to have the following solutions, p = 1 is always a solution. When d < 1, 
this is the only solution in [0,1]. When d > 1, there is a second solution p* < 1, which equals the 
probability that Poisson(d) is finite. Since predicates with property 7 Z hold for all finite trees, it 
follows that p = p* + (1 — p*)q, where q is the conditional probability of A(T) conditioned on T 
being infinite. Hence q is 0 or 1, completing the proof. □ 
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Theorem 0 follows as an immediate corollary in light of the above observation that having 
WSM (resp. SSM) is a predicate with property 1Z. 


2.5 Alternating Trees 

Consider the infinite rooted tree T with alternating layers of degree d > 12 and degree 2 vertices, 
i.e., the root has d children, each of whom have two children, each of whom have d children and 
so on. In this section, we examine the question of weak spatial mixing for such trees. Notice that, 
since T contains a complete binary tree, it does not have SSM for A > 4. 

Theorem 2.5. T has weak spatial mixing for all A < 


Proof. Consider the function 

1 

1 + A (1 + Xx 2 )~ d 

which determines the values a w of the nodes w at an even depth r when the values of a w for w at 
depth r + 2 have been set to x. Since / is the composition of two monotone decreasing functions, 
it is monotone increasing. Observe that 


g'(x) = — g(x) 2 ■ A(— d) (l + Ax 2 ) d 1 • 2Ax = 2xg(x) 2 X 2 d (l + Ax 2 ) d . 

At the boundary, the adversary can set the a w s to any values in [0,1]. However, recall that by (fTl), 
for every level above that, these values will lie in 1]. Thus x and g(x) are both between 
and 1, and we have 

\d '( x )I - + (i + 

< 2 A 2 de -A(d+1)/(1+A)2 


Since d is large, when A < 4 ^, \d'(x)\ is bounded below 1 for all x in [yyy, 1]. It follows that g is 
a contraction mapping with a unique fixed point a* in 1 ], and moreover, for any x 6 [jxy, 1 ], 
the sequence { a n } defined recursively by 


do — x, a n — g(a n -i) 


converges to a*. 

Now suppose that the adversary sets the values of all the nodes at depth R to be either all Os 
or all Is. Then applying ([Tj) results in the same values at all the nodes at depth R' which is the 
deepest even level above R. applying the function g repeatedly from then on, we see that as R 
goes to infinity, the value at the root, a r converges to a*. By the monotonicity of <[T|) with respect 
to each a w , a r converges to a* for all settings of the nodes at depth R by the adversary. It follows 
that T has weak spatial mixing for all A < □ 

On the other hand, T contains the 2-regular tree as a subtree. Thus T does not have strong 
spatial mixing for any A > 4. Thus there is a large range of A for which it has weak, but not strong, 
spatial mixing. 







Now consider the infinite tree T' all of whose vertices at depth r have d(r) children, where 


d{r) 


2 if r is odd 

2 r+1 if r is even 


(or any increasing function of r on the even levels should be fine.) As before T' contains the complete 
infinite binary tree as a subtree, and so has no strong spatial mixing above A = 4. However, it is 
easily seen that T' has weak spatial mixing for all A. 


3 Infinite Fugacity: Maximum Independent Sets 

In this section we derive upper and lower bounds on the Weak Spatial Mixing threshold in the 
infinite fugacity case. We note that at infinite fugacity, the Poisson tree with average degree d does 
not exhibit Strong Spatial Mixing unless d < 1 in which case the tree is almost certainly finite. 

When A = oo, equation 0 is potentially indeterminate, so a good first step would be to re¬ 
examine the definition of the model. The defining notion is that, for any finite patch with boundary 
condition, the distribution should be uniform over independent sets of the maximum possible size. 
However, in order to understand whether this condition leads to a unique Gibbs measure, we still 
want a recurrence for the probabilities a v , that v is unoccupied, conditioned on its parent being 
unoccupied. 

There are a couple of good ways to deal with the indeterminism in (jT]) . First, we can do 
arithmetic in the ring M[A^ 1 ]/(A _2 ), where we treat A -1 as an infinitesimal, that can be ignored 
when added to any non-zero real number, and whose square is treated as zero. The expression 
1/(1 + A EL a w) evaluates to: 

1 . 1 whenever two or more of the a w are infinitesimal, 

2 . A -1 flu, a w 1 ^ none of the a w are infinitesimal, and 

3. 1/(1 + c w > Ylww^w' a w) if exactly one vertex w' has the infinitesimal value a w i = c w i A -1 . 

The second approach is to treat the above infinitesimals as zeros, but to reconstruct the coefficient 
c w ' in case 3, from the values on the children of w'. This gives the formula 


* U z a z + U w ^aJ 

where 2 ranges over the children of the unique child w' with a w > = 0. 

We will refer to vertex v as “large” when a v evaluates to a non-zero real number, and as 
“small” when it evaluates to an infinitesimal (or zero, if you prefer that viewpoint). There is a 
third possibility, namely that no finite piece of the tree suffices to determine whether a v is large or 
small, because of infinite descent; in this case, we say a v is “unlabeled.” Our rules above now give 
a particularly easy recursive description of when a node is large, small, or unlabeled: 

a. If one or more children of v is small, then v is large. 

b. If all children of v are large, then v is small. 
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c. Otherwise, no child of v is small, and at least one child is unlabeled. In this case, v is 
unlabeled. 

We call this process Karp-Sipser labeling , since it is a bottom-up version of the Karp-Sipser algo¬ 
rithm [3j, which generates an independent set S in a graph by choosing a vertex v of degree 1 or 0, 
placing v in S, and removing v and its neighbor, if any, from the graph. See Figure [2} 

Starting from the leaves, which are small, one can work upward through the tree, using rules 1 
and 2 to assign labels to all the small and large nodes. The nodes that remain unlabeled after this 
(infinite) process are the ones we called “unlabeled” above. It is easy to see that, by induction, 
each unlabeled node sits on top of an infinite leafless subtree of unlabeled nodes. The unlabeled 
nodes in this tree may also have additional children that are labeled large, who in turn have other 
children, about which we are not concerned. 

Now, suppose we cut off our tree at depth R, and set a pair of boundary conditions on these 
nodes, that respects the labeled nodes, and either occupies all or none of the remaining boundary 
nodes. More precisely, under the first boundary condition, the occupied nodes at depth R are 
exactly the ones labeled ” small,” while under the second boundary condition, the unlabeled nodes 
are also occupied. 

In this case, it is easy to see by induction that, subject to this new boundary condition, all the 
labeled nodes at depth < R will keep their original labels, and therefore the previously unlabeled 
nodes at depth i < R will either be all large or all small, depending on the parity of R — i and 
which of the two boundary conditions was set. 

Now let ps, Pl and pjj denote the probabilities that the root is labeled ‘small’, ‘large’ or 
‘unlabeled’ respectively. We have 

PS +PL+PU = 1 

Then, by rules a, b, and c above, ps is the probability that all the root’s children are large, 
while pl is the probability that at least one child of the root is small. Since each child is the root 
of an independently random subtree, which is distributed just as the entire tree is, the number of 
children that are large or small or unlabeled is Poisson-distributed with mean dpL or dps or dpu 
respectively. This gives 

p L = 1 - e~ dps and Ps = e~^ Ps+Pu) = e~ d(1 ~ PL) (2) 

Together, these imply 

—de~ dp S 

PS = e 

Letting / denote the function f(x) = e~ dx , we see that ps is a fixed point of f o f. One fixed point 
of / o / is the (unique) fixed point of /. Using Lambert’s W function, where z = W(z)e w ( z \ this 
fixed point can be written as W{d)/d. In fact, when d < e this is the only real fixed point. In that 
case, 

p s = W(d)/d, and p L = 1 - f(p s ) = 1 - p s 

so that pu = 0 and the root is labeled with probability 1. When d > e, on the other hand, the 
smallest real fixed point of / o / is strictly smaller than W(d)/d, and is not a fixed point of /. In 
that case, the smallest fixed point is ps, and hence pjj > 0, i.e., with constant probability the root 
remains unlabeled. 

We remark that all this corresponds exactly to the rigorous results on the Karp-Sipser algo¬ 
rithm ®m- On G{n,p = d/n), if d < e then the algorithm finds a maximal independent set, except 
for a core that consists w.h.p. of O(logn) vertex-disjoint cycles. 
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We are now ready to prove our upper and lower bounds. 


3.1 Upper Bound 

In this section we will analyze the situation when A = oo and d > e. Recall that in this case 
the root has positive probability pjj to be unlabeled. Moreover, regardless of the root’s label, the 
number of children of the root that are, respectively, small, large and unlabeled are independent 
Poisson random variables with parameter, respectively, dps, dps and dpy. 

It follows that with positive probability, the root is unlabeled and has at least two unlabeled 
children (and no small children.) In this event, based on the parity of R, one boundary condition 
at depth R forces both those unlabeled children to be occupied while the other forces them both 
to be unoccupied. Since the independent set must be of maximum size, if both are occupied then 
the root is forced to be unoccupied, while if both are unoccupied, the root is forced to be occupied. 
Since these two alternatives remain possible, independent of R, there is no weak spatial mixing at 


the root. We have shown the following, which implies the second half of Theorem 1.3 


Theorem 3.1. For A = oo and d > e, with positive probability, the Poisson(d ) tree does not have 
WSM at the root. 


We remark that if the Poisson tree is infinite, it almost surely contains some node which is 
unlabeled and has at least two unlabeled children. 


3.2 Lower Bound 

In this section we analyze the situation when d < e. Recall that in this case ps = W(d)/d was the 
unique fixed point of f(x) = e~ dx , so that 

PS = e~ d ™ (3) 

and by ([2]), ps = 1 — e~ dps = 1 — ps, i.e., the root is labeled as either ‘small’ or ‘large’ with 
probability 1. Moreover, this labeling obeys the rules that 

• all children of a small node are large, and 

• at least one child of a large node is small. 

To what do these labels correspond? Intuitively, a vertex being labeled ‘small’ or ‘large’ re¬ 
spectively, corresponds to having non-occupation probabilities (as the root of its subtree) that are 
small or large respectively. For finite A 1, roughly speaking, this means 0(1/A) or 0(1) respec¬ 
tively. Note however, that this intuition can sometimes be incorrect, for instance a node with very 
many children, all “large,” may have a large non-occupation probability, even though it receives 
a label of “small.” Another example where the above intuition fails is for nodes at the root of a 
subtree isomorphic to a very long path, specifically one of length A). Although the nodes in this 
path are labelled with alternating “small” and “large” labels, actually almost all the conditional 
non-occupation probabilities will be approximately l/\/X 

When A is infinite, this becomes a distinction of zero vs. non-zero. In other words, conditioned 
on its parent being unoccupied, (or equivalently, looking at it as the root of it subtree), if vertex 
v is labeled ‘large’ then there are maximum independent sets on its subtree which do not contain 


12 



v ( i.e ., there are configurations in which v is unoccupied), and a v > 0, whereas if v is labeled 
‘small’ then every maximum independent set contains v (i.e., v is occupied in all configurations) 
and a v = 0. 

Now consider a ‘large’ node v with two or more ‘small’ children. Looking at the recurrence 0 > 
and the rules for arithmetic in the ring M[A 1 ]/(A 2 ), we see that regardless of the non-occupation 
probabilities of all the other children of v, a v = 1. 

In other words, if v has two or more children that are probably occupied, then v is probably 
empty, regardless of what other children it has. We say in this situation that a v is known. More 
generally, we say that for ‘large’ v, a v is known whenever it is determined by a finite subtree of 
u’s descendants. In particular, known a v s are rational. For technical reasons, we will not say a v is 
known for all v that are ‘small’, but rather only those v all of whose children are known 

Let kl and ks denote the probability that a v is large and known, or small and known, respec¬ 
tively. If a v is large, it is known either if it has two or more small children, or if all its children 
are known and exactly one of them is small. If a v is small, then it is known if and only if all its 
children (which are large) are known. This gives us the equations 


kl = 1 — (1 + dps) e dps + dns e dps e d( ' PL KL 
K S = e ~dp S e -d(PL-K L ) . 

Simplifying and combining with ([3]) gives the relations 

PL- k l = d(p 2 s - Kg) 

k s = p s e~ d ( pL ~ KL) . 

Rearranging terms and once again using © we see that 

kl = 1 (IT dps)ps + de~ 2d{ ' l ~ K ’ L ^ 


(4) 

(5) 


( 6 ) 

(7) 


so that kl is a fixed point the function 

g(x) := 1 - (1 + W(d))^f + de~ 2d ^~ KL) . 

The system of equations ([6]) and Q always has (kl, ks) = (pl,Ps) as one solution. Additionally, 
when d is sufficiently large, there is a second solution where kl < Pl and ks < Ps, corresponding 
to the fact that for large enough d < e there are graphs for which even though the root v is labeled 
“large”, the actual value of a v is not determined by any finite subtree of the Poisson tree. The 
threshold where these roots appear is the d such that 

9 (Pl) = 1 = 2d 2 p|, 


which with ([2]) implies 

e i/V2 

d= = 1.434.... 

V2 
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4 Finite A case: Lower bound 


In this section we will derive a lower bound on the SSM threshold for the Poisson tree. This proves 
part [2] of Theorem 


By Remark 2.2 


1.2 


in order to show SSM, it suffices to show WSM for any subtree of Poisson(d). 

Let T be a subtree of Poisson(d) and let r be the root of T. For R > 0, let Tr denote the 
truncation of T to depth R, and let dT R denote the boundary of Tr, i.e., the vertices of T at 
depth R. We want to study the influence of the non-occupation probability values at dT R (set 
adversarially) on the value of a r . For notational convenience we will require the adversary to set 
the values at dT R from [yyy, 1]. Since the range of the function x ^ 1+ ^p, r on [0,1] is contained 
in [yyy, 1] when 0 < P < 1, this corresponds to allowing the adversary to set values in [0,1] on 
dT R+1 . 


Recall, from Proposition |2.3| that to show WSM for T, it suffices to show that there is a 
well-defined non-occupation probability a* at the root r of T. This, in turn, would follow if the 
non-occupation probabilities induced at r by setting the vertices in dT R+ \ to all zeroes or all ones 
converged to the same value as R —> oo. 

Let w be a vertex of dT R . Suppose the values at all the other vertices in d Tr are fixed, and 
only the value a w at w is varied. Let a P’"- 1 /( 1+A )] and a j, au,_1 l be the values of a r when a w is set to 
yyjy or 1 respectively. Then by the mean value theorem, 


a [o w =l/(l+A)] _ a [a w =i] 


< max 




da r 

da w 


Now, if o® and a * are the values of a r when the vertices at depth R + 1 have been set respectively 
to all zeroes or all ones (he., the vertices in dT R set to all ones or all yyy) then by varying the 
values at the boundary vertices one at a time and applying the triangle inequality, we see that 


la? — all 


< 


E 

w£dTn 


a w E 


max 

i 


l+A 


d] 


da r 


da v 


( 8 ) 


Fix w £ <9Tr and let r = wq, w\, . .. w R -±,w R = w be the path from the root to w. Let a* = a Wi 
and let Pi = fL a x where the product is taken over all the children x (if any) of Wi other than 
Wi- |_i. Then for all i, 

_ 1 

1 + Adj+iPj 

Note that a* > yy^ > 0 for al i. Differentiating with respect to a* + i, with some algebraic 
manipulations, we have 

dcij _ -A Pi _ -aj( 1 - cij) 

da i+ 1 (1 + Xa i+ iPi) 2 a i+ \ 

Repeatedly applying the chain rule, we see that 


da r 

da w 

Since yyy < CLi < 1 , 


a R ~ 1 a 

uclq t r uCL' 


R -1 




1 CLi) 


R-l 




"« 4 


da r 


da v 


R -1 


R -1 


P if 


- Cli 


(9) 
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Note that a* > 1+ / a . +1 , so that (1 — aj)(l — aj+i) < Aai j f '_J_^ a .^ I 1 +1 ^ • To bound the partial 
derivative, we want to maximze this subject to the constraint that a«+i > yT_. 

Consider the function x i—> ^^xx'* on ' n t erva l [yyy- !]• Differentiating, we see that when 
A > 1+ 2 V ^ , it is maximized at and that the maximum value is 1 — ^ (\/l + A — l). Thus 

(1 — dj)(l — dj+i) < 1 — j (\/l + A — l). Applying this to consecutive pairs in n^o^l — a i)> we 
have, for even R 


da r 


dan. 


R -1 


^ O / _ N \ ^/ 2 

< (1 + A) (1 ~ a t ) < (1 + A) ( 1 — y (\/l + A — 1 

i=0 


( 10 ) 


On the other hand, if A < 1+ 2 V ^ , then the derivative of A ’^+a^ ) never zero i n [yvj-1], and the 
function is maximized at yT^. Thus (1 — dj)(l — ai + \) < ( 1 + a) A (i+2A) ’ an( l once again, applying this 


to consecutive pairs, for even R, 

da r 


dan. 


<(1 + A) 


A 2 


(1 + A)(l + 2A) 


R/2 


Let us now re-examine ([ 8 ]). We have 


|a? - all < > max 

— z J c [ i ii 


da r 


dan. 


< \dT R \B x , R 


( 11 ) 


( 12 ) 


da r 


da u 


where B\ jR is an upper bound on 

Since T is a subtree of a Poisson(d) tree, it follows from Proposition 2.1 that, almost surely, for 
all sufficiently large R 

\dT n \ < R 2 d R . (13) 


If A > 1+ ^ then substituting B\ tR = (1 + A) (l — | (\/l + A — l ))^ 2 into (12), we have 


R/2 


d Ml — v V1 + A — 1 


|a(? — a* | < R 2 d R (l + A) ^1 — — ^x/T+T _ l) J =# 2 (1 + A) 

which goes to 0 as R — > oo as long as d 2 (l — | (a/ 1 + A — l)) < 1, i.e., A < 

If A < l+ if^ then substituting B\ jR = (1 + A)A K (1 + A)~ R / 2 (1 + 2\)~ R / 2 into (12), we have 


R/2 


«r — I < R 2 d K ( 1 + A) 


2 aRr 


X R 


= R\l + X) 


d 2 A 2 


_ (1 + A) (1 + 2A) _ 


R/2 


(1 + A) R / 2 (1 + 2X) r / 2 

which goes to 0 as R — > oo as long as d 2 A 2 < (1 + A)(l + 2A), i.e., X < 3+ ^ d 2 ^ d ~- ■ 

The transition point, A = corresponds to d = \J2 + \/5 which is approximately 2.058. 
Thus we have shown WSM for independent sets with fugacity A on any subtree T of a Poisson(d) 
tree, when 

J przyp when d < y/2 + a/5 
< 1 3+ /J- 4 d2 otherwise. 


By Remark 2.2 we have SSM for Poisson(d) for A in the same range. 
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5 Mixing for small d 

In this section we prove part [l] of Theorem |1.2[ which we now restate in an equivalent form. 

Theorem 5.1. For all d < 1.179..., the Poisson(d) tree almost surely has weak spatial mixing for 
all finite A > 0. 

Proof. Recall our formula for the influence of a leaf w along the path v = vq,v\,... ,vr = w: 

r -1 


d In a v 
<9 In a w 


IK 1 

i=l 


Cli 


(14) 


We claim that the existence of this path tells us nothing about the other branches of the tree that do 
not survive to depth R. In particular, the number of childless children of each v% for 0 < i < R — 1 
is independent, and Poisson-distributed with mean p = de~ d . 

The presence of these small leaves gives us a better upper bound on 1 — a*. In particular, if Vi 
has Cj childless children, then 

A 


1 — a-i < 1 — 

Thus w' s expected influence is at most 
R -2 

e TT — 

pa 11 l 


l 


A 


{ci} (1 + A)* + A 


= E 


1 


A 




c (1 + A) c + A 


R-l 


< 


+ (1 - e _A1 ) — 


1 +A 


1 + 2A 


R-l 


The expected total influence of all the leaves is this times d R , which is exponentially small if 


+ (1 - e-") — 


1 + A 


1 

1 + 2A < d ' 


The left-hand side is monotonically increasing with A, so this inequality holds as long as 

1 + e - ** 1 

2 < d' 

Substituting p = deT d , we hnd that this holds for all d < 1.179. 

We have made no attempt to optimize the constant in Theorem |5.1[ 
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Remark 5.2. Note that for any d > 1, there is a A for which Poisson(d) tree lacks strong spatial 
mixing. The reason (as pointed out to us by Allan Sly) is that it possesses, with positive probability, 
subgraphs that are “stretched” versions of the infinite binary tree, which branch every c generations 
for some constant c. See Figure [3j Such trees lack weak spatial mixing for sufficiently large A, since 
if 

1 . .. , , 1 


hip) = 


and f 2 {a) = 


the function 


1 + Aa ' 1 + Aa 2 

/i(/i(---(/i(/ 2 (a))))) 


c times 


has a stable period-2 orbit for sufficiently large A. 
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Figure 3: An example of a stretched binary tree with c = 3. 


6 Non-mixing just above the threshold 


In this section we will prove that, for sufficiently large but constant d, the Poisson(d) tree lacks 
spatial mixing just above the threshold for d-regular trees. First note that the latter is 


d d 

(d- l) d+1 


- d + o(i/df. 


Note that for z £ [—1,1], 

1-z < < 1-z + z 2 (15) 

Let v be a vertex at level L — 1. By ( |15[ ) and the definition of a v ([T]) , we have 

1 — A a w < a v < 1 — A a w + A^ a^, 


where the product is over the children w of v, which are at level L. Taking expectations, we have 


1 - AE 



<Ea„ < 1 — AE 



+ A 2 E 



(16) 


Let cll denote the non-occupation probability of a generic vertex at level L, in a Poisson tree 
truncated at depth R. (Note that these are independent and identically distributed.) Let K ~ 
Poisson(d) denote the number of children of vertex v, and let a\, a, 2 , ■ ■ ■ ax denote the non-occupation 
probabilities of these children. Then the a^s are independent of each other and I\ and each has 
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expectation E ai,. So 


E 


Similarly, 


Substituting into (16), we have 


n 


= E 

= E 

= E 


E 

' K 


K 


n«*i* 

i =1 

n>M k \ 

i =1 

[pEo L ] 

k\ 


l K 


°° 

Eirf'd 

fc =0 


_ e -d(l-Eai,) 


E 


n 


= p -d(l-E a|) 


1 - Ae- d(1 ' Eai) < Eo L _i < 1 - Ae- d(1 - Eai) + A 2 . 


(17) 


If we define 


we can rewrite © 


4>q(z) = 1 - Ae 2) + qX 2 , 


</>o(lE ol_i) < E a L < 4 > q (E a L _i), 


where 

q = e - d (i- Ea i) e [o,i]. 


(18) 

(19) 


The following lemma shows that for A just above e/d, even if an adversary controls the second 
moment Ea 2 and hence the coefficient q of the quadratic term, this function oscillates between 
two disjoint intervals. It follows that the expected occupation probability at the root alternates 
between high and low values based on the parity of the depth of the tree, implying a lack of spatial 
mixing. 


Lemma 6.1. For fixed X, d, and q G [0,1], let 4> q (z ) be defined as in ( 18). Let X = c/d where c > e 
is a constant. Then there are constants d*, b±, and 62 such that, for all d > d* and all q G [0,1] , 


Vz > 1 — b\jd : <j) q (z) < 1 — 62 /d 

Vz < 1 — 62/d : 4 > q (z ) > 1 — 61/d. 


and where b± <62. 

Proof. Since <f>o is monotonically decreasing, it has a unique fixed point zq = </>o(~o)> namely 

zq = 1 — where 6 q = W (Ad) = W (c). 
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Here W(x) is Lambert’s function, i.e., the unique positive root y of ye y 
— W(dX). If c > e then W(c) > 1, making this fixed point unstable. 

To focus on cj)q s behavior near zq we change variables, setting z = zq 
to z is equivalent to applying ip q (d) to d, where 

ip q (d) = d ■ ((j) q (z 0 + d/d) - zo) = -(e s - 1) W(Xd) + q\ 2 d = -(e 5 

Since ^q(O) = — W(c) and ip o is analytic, for any constant 1 < A < W(c), 
such that 

V5e[-M]: g' 0 (S)<-A. 

Therefore, for any 5* < 5 we have 

yd > 5* : ip 0 (5) <-Ad* and V<5 < -<5* : ip 0 (d) > Ad* . 

Choose such an A and such a <5* with 6* < bo. Finally, since ipo(d) < ip q {d) < ipo{d) + c 2 / d, if d > d* 
is sufficiently large so that 

C /<(A-IW. 

the proof is completed by setting b\ = bo — d* and 62 = bo + d*. □ 

7 Asymptotically Optimal Lower Bound 

We saw in Section [ 6 ] that asymptotically, for large d the Poisson(d) tree does not have weak spatial 
mixing for A just above e/d, which is the asymptotic threshold for WSM (and SSM) for the d-regular 
tree. We will now show that below e/d the Poisson(d) tree almost certainly does have weak spatial 
mixing. Specifically we will prove the following result, which is equivalent to part[3]of Theorem |1.2| 

Theorem 7.1. For all 7 6 (0,1), for all sufficiently large d, the Poisson(d) tree with activity 
A = (1 — 7 )e/d exhibits weak spatial mixing with probability 1. 

The proof is fairly involved, and we begin by presenting a summary of the main ideas involved. 

Proof Sketch. To show WSM we need to show that there is a well defined non-occupation probability 
a* at the root, i.e., that the sequences a® R and a* R converge to a common limit. As in Section|4] 
we bound \a-® R — a l R \ by the sum of the absolute values of the partial derivatives da r /da w where 
re is a vertex at depth R. We know that there are almost surely at most R 2 d R such vertices, for all 
sufficiently large R. The improvement in this argument comes from proving a better upper bound 
on n„(l-a v ) which controls the size of \da r /da w \. Here, the product is taken over all vertices v 
on the path from r to w. The main idea is that when d is very large, most of the vertices on the 
path from r to w are “good” in the sense that they and all their descendants to some depth h have 
degrees very close to d. In other words, each such vertex v is the root of a nearly regular d-ary 
subtree of depth h. For large enough h, this means that a v is very close to the fixed point a* of the 
function fd(x) = (1 + Ax rf ) _1 , which exists since A is less than the regular d-ary threshold. Thus 
for each good vertex v, (1 — a v ) < c/d for some small c < 1 and it only remains to show that there 
are almost surely enough good vertices that, for all sufficiently large R, the bound fL(l — a v) for 
each path to depth R beats the R 2 d R such paths. □ 


= x. We have f Q (x 0 ) = 
+ d/d. Then applying cp q 

-1 )W(c)+ q -^. 
there is a constant <5 > 0 
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We devote the rest of this section to making the above argument rigorous. 


Remark 7.2. Unlike the proof in Section|4j this proof does not show strong spatial mixing. Passing 
to a subtree can destroy the property that most vertices have nearly d- ary subtrees to some depth 
(or even that they have degree close to d). Given the results in Section |4j it is an open question 
whether SSM holds with high probability for A between 1/d and e/d. 


The proof of Theorem 7.1 rests heavily of the fact that most of the vertices in the Poisson(d) 
tree are roots of subtrees (to some depth) that are almost d- ary. In order to make precise what we 
mean by “almost d-ary”, we will first need some definitions. 


Definition 7.3. An (a, 6)-tree is an infinite rooted tree in which every vertex at an even depth has 
a children and every node at an odd depth has b children. A truncated (o, 6)-tree is the truncation 
of an (a, 6)-tree to some finite depth R. 

Definition 7.4. Let 0 < Ai < A 2 . A rooted tree T is [Ai, A 2 ]-regular if the number of children 
of every vertex is in [Ai, A 2 ]. 

By an almost d- ary tree, we will mean a [(1 — e)d, (1 + e)d]-regular tree. In what follows we 
will show that such a tree behaves like a d- ary tree, in that if the tree is sufficiently deep, then for 
almost the same range of A as for the d- ary tree, the non-occupation probabilities converge to well 
defined value at the root. 

Our next result gives us a way to find a (Ai, A 2 ) tree and a (A 2 , Ai) tree “near” any [Ai, A 2 ]- 
regular tree. See Figure [4] for illustrations. 

Lemma 7.5 (Pruning/Grafting). Let T be a [Ai, A 2 \-regular tree with root v and depth R. Then 

1. T can be transformed into a truncated (Ai, A 2 )-tree T' of depth R, rooted at v, by pruning 
(removing children along with their entire subtrees) at even levels and grafting (adding children 
together with an appropriate subtree) at odd levels. 

2. T can be transformed into a truncated (A 2 , Ai )-tree T" of depth R, rooted at v, by grafting 
at even levels and pruning at odd levels. 

Let a v , a' v and a” denote the non-occupation probabilities at the root in T, T' and T" respectively, 
when all their leaves are set to the same value a 0 E [0,1] . Then 


a v A a v A a v 


Proof. By induction on depth of T. □ 

Recalling that 

^ d ^ ~ 1 + A a d ’ 

we wish to prove, for certain values of A, that iterating /A, o /A 2 causes a v to converge to a unique 
fixed point. The following two lemmas establish the existence and uniqueness of this fixed point, 
and bound its location. 
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(b) Two branches pruned, one grafted. 


(c) The resulting (l,3)-tree. 



(d) Opposite parity of levels. 


(e) The resulting (3, l)-tree. 


Figure 4: Applying Lemma 7.5 Old subtrees are pruned and new ones grafted on, on alternating 
levels. 



Lemma 7.6. Let Ai, A 2 > 2, and let 

(2o) 

For any A < A(Ai, A 2 ), there is a unique fixed point a* such that (/a, o /a 2 )(o*) = o*. Moreover, 
there is a constant c < 1 such that 

|(/Ai 0 fA 2 )\ao) - o*| < c* _1 In (A + 1) . 
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Moreover, 


c<-U + 


A(Ai, A2) 

Proof. We will begin by changing variables. First define y = In a, in which case y E (— 00 , 0] and 

9d{y) = — hi(l + Xe dy ). 

Note that gA x o g\ 2 is monotonically increasing. We will show that, for any A < A(Ai, A 2 ), there 
is a constant c < 1 such that 


d 

d y 


gA 1 {gA 2 (y)) = 9^(9 a 2 (v)) g'/\ 2 {y) < c for all y < 0 . 


( 21 ) 


This implies that the fixed point y* = (g^ o gA 2 ){y*) = hi a* is unique, and that we approach 
it exponentially quickly as we iterate gA 1 ° gA 2 ■ Rather than finding c as a function of A, it is 
analytically simpler to find a A such that (21) holds for a given c, and then showing that this A 
coincides with A(Ai, A 2 ) when c = 1. 

It is convenient to do one more change of variables, from y to gf^(y) (which is well-defined since 
gd is monotonic). Thus we can focus on 


Hy) =9 'a.(v)s\,(SaM = AiA 2e Al »(l -e») 


A 


1 + Ae A i y 


We will find a A such that h(y) < c for all y < 0. For any fixed y, h(y) is a monotonically increasing 
function of A. Moreover, we can find the A where h(y) = c, namely 


X c (y) = 


ce 


-Ai y 


AiA 2 (1 - ey) - c ’ 


where we note that if AiA2(1 — e y ) < c then h(y) < c for all A > 0. Taking derivatives, we find 
that A c (y) is minimized at 

AiA 2 - c 

2/min = In ■ 


(1 +Ai)A 2 ’ 


where 


Ac — A c (y m in) — CA2 


Ai + 1 
AiA 2 — c 


Ai + l 


( 22 ) 


Thus if A < A c , we have h(y) < c for all y < 0. 

Now note that A c is a strictly increasing function of c, and that it ranges from 0 to A(Ai, A 2 ) 
as c goes from 0 to 1. Thus for any 0 < A < A(Ai, A 2 ) there is a c = c(A) < 1 such that A = A c , 
and (21) holds. Specifically, an easy calculation shows that d 2 A c /dc 2 > 0 for 0 < c < 1, and that 


1 


dA 


A(Ai, A 2 ) dc 


C= 1 


Ai(A 2 + 1) 
AiA 2 — 1 


< 2 . 


(Indeed, this derivative is 1 + 0(1/A 2 ).) Therefore, 


A c > A(A 1 ,A 2 )(l-2(l-c)) , 
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and so 


c <^ii + 


A 


A(A 1 ,A 2 ) 

To complete the proof, each time we iterate gAi ° 5 a 2 ; an y interval shrinks by a factor of c. 
Since g Ai o g^ 2 maps (—oo,0] into (—ln(A + 1), 0], the width of any interval after t iterations is 
at most c t_1 ln(A + 1). The same bound holds when we change variables back to a = e y , since 
de y /dy < 1 for all y < 0 . □ 


Note that when Ai = A 2 = A, the value of A defined in Lemma 7.6 becomes the known value 
for the A-regular tree, 


A(A, A) = A a 
W e will also use the following lower bound, 


A + 1 
A 2 - 1 


A(A l5 A 2 ) > Aj 


Ai + 1 
AiA 2 


Ai+l 


A+l 


1 

a 2 


A z 


(A — 1) A+1 ' 


1 + 


Ai 


Ai+l 


> 


a 2 


(23) 


Lemma 7.7. Let 7 € (0,1) and let A = AL-hA. Let e = 7 2 / 4 . There is a constant do = do( 7 ) such 
that for all d > do, the fixed point a* of /(i_ £ )d o /( 1+e ) d is at least 1 — ( 77 ^ 7 - 

Proof. As before, we change variables to y = In a, and consider the fixed point of 9(\- £ \d o g(\ +£ )d 
where gd(y) = — ln(l + Xe dy ). First, we show the conditions of Lemma 
definition of A(Ai, A 2 ) from (20). Since e = 7 2 /4 < 7 we have 


7.6 


are met. Recall the 


A = (l~7)e < (1 ~e)e < 


d 


d 


(1 + e)d 


< A((l — e)d, (1 + s)d) 


where the last inequality follows from (23). 


Now that we know that o (j(\ +e yi has a unique fixed point, it suffices to show that for 


y (l+e)d 


9(1—e)d (d(l+e)(i (l/)) — V ■ 


(24) 


In that case, the fixed point a* is at least e y >l + y = l— ( 7777 - Since 


x 


— x < — ln(l + x) < —x (^1 — , 

whenever x > 0 , we have 

9(i+e)d (y) = - ln(l + \e {1+£)dy ) = - In (1 + A/e) < -(A/e)(l - A/2e) 

and for any z, 

g { i- E )d(z) = — ln(l + Xe^ dz ) > -Xe^ dz . 

Since 9(\- e )d is monotonically decreasing, recalling A = — , we have 

9(1— e)d ^5(l+e)d ^ ^ ^ — 5 ( 1 — e)d ( — (-^/ e ) (1 — 'V^ 6 )) 

> _ - \ e - rf ( 1 -e)(Ve)(l-A/2e) 

= (l^H e l-(l- £ )(l- 7 )(l-^) . 
d 
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Thus to prove (24) it suffices to show that 


> (1 - ^gMi-eXi-TKl-^) 

1 + e “ v ’ 

or equivalently, setting e = 7 2 / 4 , 

- ln(l + 7 2 / 4 ) > ln(l - 7 ) + 1 - (1 - 7 2 / 4 )(l - 7) ^1 - ( 25 ) 

We choose do such that 7 ^ < 7 3 / 3 . Then, recalling that ln(l — 7 ) = — 7 */*, for all d > do, 

we have 

1 + ln(l - 7 ) + ln(l + 7 2 / 4 ) < 

< 

< 

< 


% 4 

7 2 7 3 
1 — 7- 

1 4 3 

i-7- - i—r 

d-7 2 /4)(l-7)(l-T^) 


which implies (25). 


□ 


Let R and a\ R denote the non-occupation probabilities at the root of the Poisson(d) tree with 

activity A = — when the vertices at depth R are all occupied or all unoccupied respectively. 
We are now ready to prove 


Theorem 7.8. For all 7 G (0,1), for all sufficiently large d, for all 5 £ (0,1), there exists Ro such 
that 


Pr 


((Vi? > i?o)|a“ ii? - a**| < e^ 2R / 56 ) >1-8. 


Fix 7 G (0,1). Let A = ^ , and, as before, let e = 7 2 / 4 . Denote h = 1 + 

We’ll call a vertex u in the Poisson(d) tree good if its subtree to depth 2 h is 


2 log 7 —4 


i°g(i— 7 / 2 ) 

(1 - e)d, (1 + e)d]- 

regular. Note that for a Poisson random variable X with mean d, and 0 < e < 1, the following 
Chernoff bound holds: 

Prob (\X -d\>ed)< 2e~ £2d/3 


(This follows, e.g., from [5J Theorem 5.4 and inequalities (4.2), (4.5)].) Applying this to the vertex 
degrees in the subtree of depth 2 h rooted at u , and taking a union bound, we find 


Prob (u is good) > 1 — 2 ((1 + e)d ) 2/i+1 e £2 ^ 3 > 1 — e e2rf / 4 


for all sufficiently large d. 

Lemma 7.9. If u is a good vertex then, subject to any boundary condition at least 2 h levels below 
u, we have 1 — a u < V . 
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Proof. Since the tree of depth 2 h rooted at u has even depth, a u is minimized when all its de- 
scendents at depth 2 h below it are set to 0. Let a° be this minimum value, and let a! u be the 
non-occupation probability at u of the ((1 — e)d, (1 + e)d) alternating tree of height 2 h rooted at 
u, when all its leaves are set to 0 . 



A = c((l + s)d)( 1 ~ £ ^ d ^ 


(1 — e)d + 1 


> c 


> 


1 


(1 + s)d y 
c e 


(1 — e)d{ 1 + e)d — c 
/ (I -e)d + l A (1 ~ e)d+1 


( 1 — e)d+l 


(1 — e)d 


(1 + e)d 


whence it follows that 


c < 


(1 + e)d\ 


= (l- 7 )(l + e)< 1 - 7 / 2 , 


since by definition, e = 7 2 / 4 . By our choice of h, it follows that c h 1 e < y 2 /8 = e/2. 


Since a' u = /(i_ e )d ° /(i+ £ )d(0), by Lemma 7.6 it follows that 


a' u — a*\ < c h 1 ln(l + A) 
< c h - l X 

c h ~ 1 e( 1 — 7 ) 
= d 

<^P-. 

(1 + s)d 


Rearranging terms, we see that 


> a° u > a' u > a* 


e /2 1 + ^e 

(1 + e)d (l-fe)d’ 


Finally, 


1 Q>u — 


1 + \e 

(1 + s)d 


whence the lemma follows. 


1 

d 



2(1 + e) 


e s/3 

< - 

“ d 


M 


Consider any path P from the root to a leaf at depth R in the truncated Poisson(d) tree. Fix 
j G {0,1,..., 2 h — 1}. Let Pj = {u G P\ depth(u) = j (mod 2 h)}. For u G Pj, the events that u is 
bad are independent. 

Let Xp j denote the number of bad u in Pj. Then E Xpj < (i?/(2/i))e _£ “ d ' /4 , and by Chernoff’s 
bound, for any a > 1 , 


Pr 


v R 
Xp j ^ a 2k e 



a(R/(2h))e ~ e2d / 4 
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Choosing a = ee £2<i / 4 /41og(d), which is exponential in d, we see that the right hand side becomes 


e \ eR/Sh log(d) 

a / 

In particular, for sufficiently large d, this is less than 

c ~e 3 di?/40/ilog(d) 

This is so tiny that, even if we take a union bound over all R, all j < 2 h, and the “first” d R 
paths of length R from the root, the resulting probability bound still can be made smaller than 
5/2. 

Applying Markov’s inequality to the expected number of nodes at depth R, we get that, with 
probability > 1 — 5/2, there are at most ^j-d R of these, for R > Rq(5). Thus, our union bound 
actually covered all the vertices at depth R. 

Let Xp = Y/j Xpj denote the total number of bad nodes on the path P. Assuming the above 
“good” event, we have for all R > Ro, and all paths P of length R, that Xp < aRexp(— e 2 d/4). 
Let w denote the leaf at depth R on P and v denote the root. Recall that 


da v 


da v 


< 


(i+n (i _ 


By Lemma 7.9 we have 


da v 


da,. 


< 


s / 3 d 


R-Xp 


< 


e / 3 d ) 


ueP 


R(l— aexp(- e 2 d/4)) 


£ / 3 J 


-R(l-e/41og(d)) 


< d R exp(—eR/3 + eR/4 + e 2 R/12 log(d)) 


< d R exp(—ei?/13). 


By (| 8 j), it follows that 


|a® — al\ — \9T R \d R exp(— eR/12>) < exp(—ei?/14), 


as desired, again assuming our good event, and noting that this implied |<9 Tr| < d R poly(R). This 


completes the proof of Theorem 7.8 


Theorem 7.8 says that for any 7 E (0,1), for sufficiently large d the Poisson(d) tree with activity 
— exhibits weak spatial mixing at the root, with probability 1. In other words, with probability 
1, there is a well-defined value a v , where v is the root. Moreover, since each node w is the root 
of its own Poisson(d) subtree, whose structure determines a w , and there are only countably many 
nodes, it follows that, with probability 1 , every node w has a well-defined value a w . 

Since a w is the probability that w is unoccupied, conditioned on its parent p(ui) being unoccu¬ 
pied, it follows that the occupation probabilities satisfy the recurrence 


Pr(u> E X) = (1 - a l0 )(l - Pr(p(io) E A)), 

and hence, by induction on depth(u;), these probabilities are well-defined, i.e. the Poisson(d) 
tree exhibits weak spatial mixing at all vertices, with probability 1. This completes the proof of 
Theorem iTTl 
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